Rank in Wordlist | Frequency | Word |
---|---|---|
6417 | 24 | l'1,5 |
6988 | 21 | 5,8 |
8113 | 17 | 1,5 |
8121 | 17 | 7,5 |
8479 | 16 | 5,3 |
8847 | 15 | 0,7 |
9316 | 14 | 0,5 |
9326 | 14 | 3,5 |
9823 | 13 | 2,2 |
10387 | 12 | 0,3 |
Rank in Wordlist | Frequency | Word |
---|---|---|
43860 | 1 | 11(novembre |
46628 | 1 | Bellpuig(Urgell |
47424 | 1 | Catalana(ANC |
47440 | 1 | Catalunya(MNAC |
48710 | 1 | Droite(s |
49531 | 1 | Festival(MICFF |
61386 | 1 | comunicat(pdf |
66482 | 1 | empreses(start-ups |
68049 | 1 | extreme(s |
76594 | 1 | pàgines(pdf |
Rank in Wordlist | Frequency | Word |
---|---|---|
47087 | 1 | CM2).van |
48787 | 1 | EDADES)1 |
53860 | 1 | PP)i |
55189 | 1 | Rodamón)… |
55372 | 1 | S)movies |
Rank in Wordlist | Frequency | Word |
---|---|---|
43577 | 1 | -13%- |
43579 | 1 | -14%- |
43581 | 1 | -15%- |
43898 | 1 | 12%-14 |
44374 | 1 | 20%- |
44643 | 1 | 3%-4 |
44959 | 1 | 5%i |
Rank in Wordlist | Frequency | Word |
---|---|---|
22395 | 4 | UOC&Plugged |
31815 | 2 | AT&T |
32044 | 2 | B&H |
34906 | 2 | S&P |
49582 | 1 | Fischli&Weiss |
51274 | 1 | Kids&Us |
52359 | 1 | M&A |
55182 | 1 | Rock&LLamp |
56361 | 1 | Tasta&Jazz |
56583 | 1 | Tom&Jerry |
Rank in Wordlist | Frequency | Word |
---|---|---|
62469 | 1 | d'"Independència |
62470 | 1 | d'"exterminar |
62471 | 1 | d'"hipòcrita |
62472 | 1 | d'"oportunisme |
62473 | 1 | d'"un |
70650 | 1 | l"aeroport |
70651 | 1 | l'"Holanda |
70652 | 1 | l'"abrandada |
70653 | 1 | l'"aeroport |
70654 | 1 | l'"esperit |
Rank in Wordlist | Frequency | Word |
---|---|---|
57 | 2578 | s'ha |
79 | 2009 | d'un |
82 | 1840 | d'una |
101 | 1464 | l'estat |
138 | 1074 | s'han |
166 | 916 | l'any |
224 | 701 | s'hi |
232 | 675 | d'aquest |
280 | 586 | d'euros |
305 | 554 | d'aquesta |
Rank in Wordlist | Frequency | Word |
---|---|---|
33437 | 2 | I+D+i |
34846 | 2 | Rio+20 |
46794 | 1 | Bloc+Iniciativa |
47112 | 1 | CTRL+C |
47113 | 1 | CTRL+V |
48801 | 1 | ERC+DCat+Rcat |
48802 | 1 | ERC+Rcat+DC |
49711 | 1 | Foster+Partners |
50575 | 1 | I+D |
53861 | 1 | PP+Ciutadans |
Rank in Wordlist | Frequency | Word |
---|---|---|
9323 | 14 | 2/4 |
16515 | 6 | 3/24 |
17866 | 6 | km/h |
25090 | 4 | µg/m3 |
29204 | 3 | i/o |
31449 | 2 | 1774/2004 |
31613 | 2 | 3/1986 |
31614 | 2 | 3/4 |
31683 | 2 | 5/2012 |
33196 | 2 | GNU/Linux |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots